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ABSTRACT 

Argonaute (Ago) proteins function in RNA silencing 
as components of the RNA-induced silencing 
complex (RISC). In lower organisms, the small 
interfering RNA and miRNA pathways diverge due 
in part to sorting mechanisms that direct distinct 
small RNA (sRNA) duplexes onto specific Ago- 
RISCs. However, such sorting mechanisms appear 
to be lost in mammals. miRNAs appear not to dis- 
tinguish among Ago1-4. To determine the effect of 
viral infection on the sorting system, we compared 
the content of deep-sequenced RNA extracted from 
immunoprecipitation experiments with the Ago1 and 
Ago2 proteins using Epstein-Barr virus (EBV)- 
infected cells. Consistent with previous observa- 
tions, sequence tags derived from miRNA loci in 
EBV and humans globally associate in approxi- 
mately equivalent amounts with Ago1 and Ago2. 
Interestingly, additional sRNAs, which have not 
been registered as miRNAs, were associated with 
Ago1. Among them, some unique sequence tags 
derived from tandem loci in the human genome as- 
sociate exclusively with Ago1 but not, or rarely, with 
Ago2. This is supported by the observation that the 
expression of the unique sRNAs in the cells is highly 
dependent on Ago1 proteins. When we knocked 



down Ago1, the expression of the Ago1 -specific 
sRNAs decreased dramatically. Most importantly, 
the Ago1 -specific sRNAs bound to mRNAs and 
regulated target genes and were dramatically 
upregulated, depending on the EBV life cycle. 
Therefore, even in mammals, the sorting mechanism 
in the Ago1-4 family is functional. Moreover, the ex- 
istence of Ago1 -specific sRNAs implies vital roles in 
some aspects of mammalian biology. 

INTRODUCTION 

miRNAs are a recently discovered class of small 
noncoding RNAs that are 18-24 nucleotides long and 
that downregulate target genes at the posttranscriptional 
level. The majority of miRNA genes are transcribed by 
RNA polymerase II into long primary miRNA tran- 
scripts, processed by the nuclear nuclease Drosha into 
~60-bp hairpins, termed precursor (pre) miRNAs, and 
further cleaved in the cytosol by Dicer nuclease into 
mature miRNAs. Mature miRNAs are then incorporated 
into the multiprotein RNA-induced silencing complex 
(RISC), exerting posttranscriptional repression of 
target mRNAs, either by inducing mRNA cleavage 
and mRNA degradation or by blocking mRNA transla- 
tion (1). 

RNAi plays a critical role in innate cellular defence 
against viruses. In plants and invertebrates, viral RNA 
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genomes and mRNAs are targeted for destruction by the 
stimulated production of small interfering RNA (siRNAs) 
derived from viral double-stranded RNAs (2,3). Given the 
strong type I interferon-based antiviral response initiated 
by the intracellular double-stranded RNA sensors RNA- 
activated protein kinase (PKR), Retinoic acid-inducible 
gene-I (RIG-I) and Melanoma differentiation-associated 
gene 5 (MDA-5) in mammals, the siRNAi pathway may 
have been suppressed as a host protection mechanism 
against RNA viruses (4). Recently, analogous viral 
siRNAs were detected within the pool of small RNAs 
(sRNAs) isolated from RNA-virus-infected mammalian 
Embryonic stem (ES) cells that exhibited a low-level inter- 
feron-based antiviral response. 

In lower organisms, the siRNA and miRNA pathways 
diverge in part due to sorting mechanisms that direct 
distinct sRNA duplexes into specific Argonaute (Ago)- 
RISCs (5). However, such sorting mechanisms appear to 
be lost in mammals. miRNAs appear not to distinguish 
among Ago 1-4, while only Agol and Ago2 prefer siRNAs 
(6). In contrast, influenza A virus infected Ago 1 and 3 
double-knockout mice exhibited increased mortality, con- 
sistent with more severe alveolitis and pneumonitis, 
indicating that optimal resistance to influenza requires 
Ago 1 and/or 3. Enhanced mortality of double-knockout 
mice was not associated either with increased viral repli- 
cation or with differential pulmonary recruitment or 
function of innate and adaptive immune cells; therefore, 
its function in RNAi targeting against virus-coding RNAs 
has yet to be demonstrated. The results show that while 
miRNAs may not distinguish among Ago 1-4, this may 
not be the case for other sRNAs (7). Epstein-Barr virus 
(EBV), a member of the y-herpes virus family, was found 
to be widespread in all human populations and to persist 
in the vast majority of individuals as a lifelong, asymp- 
tomatic infection of the B-lymphocyte pool. It is usually 
the cause of infections that are not apparent, though it 
may cause infectious mononucleosis. The more severe, 
albeit rare, result of EBV infection is malignant transform- 
ation and cancer development in various forms, including 
Burkitt's lymphoma and nasopharyngeal carcinoma, one 
of the most common cancers in China. As a ubiquitous 
human pathogen, EBV-associated lymphoid malignancies 
include a subset of Burkitt's lymphoma, AIDS lymphoma, 
Hodgkin's lymphoma, posttransplant lymphoma, age- 
associated B-cell lymphoma and peripheral T- and NK- 
cell lymphoma (8). 

Like other herpes viruses, EBV infection can exhibit two 
distinct patterns or states of gene expression. During acute 
EBV infection, the virus sequentially expresses its entire 
repertoire of genes, producing a lytic infection. In this lytic 
program, linear double-stranded genomes are produced 
and packaged as virions that spread the infection from 
cell to cell. In the latent program, few viral genes are 
transcribed, no viral progeny are produced and infected 
cells are protected from apoptotic stimuli and in some 
circumstances driven to proliferate (8). 

EBV was the first human virus found to encode micro 
RNAs. EBV encodes 44 viral micro RNAs and one small- 
RNA. EBV-encoded micro RNAs originated from the 
Bam HI fragment H rightward open reading frame 1 



(BHRF1) and Bam HI A region rightward transcript 
(BART) loci of the EB viral genome. These viral micro 
RNAs play a vital role in immunogenesis, host cell 
survival and proliferation, differentiation, 
lymphomagenesis and modulating the states of viral infec- 
tion and latency (8). 

To determine whether the viral-encoded RNAs are se- 
lectively sorted and if the sorting system is affected by 
viral infection, we compared the content of deep- 
sequenced RNA extracted from immunoprecipitation 
(IP) experiments with the Agol and Ago2 proteins using 
EBV-infected cells. The EBV-encoded miRNAs are 
incorporated equally into Agol and Ago2 and identified 
a novel class of sRNAs that are preferentially 
incorporated into Agol but not Ago2. In this study, we 
investigated the abundance, expression patterns, sequence 
characteristics and functions of the novel Agol -specific- 
associated sRNAs. 

MATERIALS AND METHODS 

Cells 

The EBV-positive and -negative Hodgkin's lymphoma cell 
lines, L591 and LI 236 (9), and Akata (+) cells and acute 
monocytic leukemia cell lines, THP-1, were maintained in 
RPMI1640 medium (Nacalai tesque) supplemented with 
10% (v/v) fetal bovine serum (FBS), 50U/ml penicillin 
and 50 ug/ml streptomycin in a 50-ml flask (Sumilon). 
Human carcinoma of cervix cell line, HeLa, was main- 
tained in Dulbecco's modified Eagle's medium (Nacalai 
tesque) supplemented with 10% (v/v) FBS, 50U/ml peni- 
cillin and 50 ug/ml streptomycin in a 10-cm dish (Corning, 
Inc., Corning, NY, USA). Cells were passaged twice per 
week. 

Immunoprecipitation 

Small RNAs incorporated into Agol and Ago2 were 
immunoprecipitated from cell lysates using the 
microRNA Isolation Kit (human/mouse Agol and 
human Ago2) (Wako, Osaka, Japan) according to the 
manufacturer's protocol. 

Next-generation sequencing 

Immunoprecipitated sRNAs were converted to sequence 
libraries using the sRNA sample preparation kit 
(Illumina), according to the manufacturer's instructions. 

The libraries were sequenced using an Illumina GA IIx 
with the single 36-bp read option. Adapter sequences were 
removed from the generated sequence data. The adapter- 
trimmed 20-24-bp sequences were mapped to the human 
hgl9 genome assembly and EBV genome using the 
sequence alignment software Eland (Illumina). 

Mapped sequences without any mismatches were used 
for further analysis. Redundantly mapped sequences that 
were <256 positions in the genomes were counted as 1 
sequence at all possible mapped positions. 
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siRNA and transfection 

siRNA targeting human Agol and DROSHA was 
purchased from OriGene Technology, Inc. (MD, USA) 
and Cosmo Bio Co., Ltd. (Tokyo, Japan), respectively. 
siRNA against human Ago2 was kindly provided by Dr 
N. Kosaka (National Cancer Center Research Institute, 
Tokyo, Japan). L591 cells, LI 236 cells and Akata cells 
were suspended in resuspension buffer at a density of 
2 x 10 7 /ml and siRNA or the negative control synthetic 
RNA (Bioneer, Inc., CA, USA) was added at a final con- 
centration of 1 uM (for Akata cells, 500 nM). The trans- 
fection of siRNA into cells was performed with the Neon® 
transfection system (Invitrogen, co., CA, USA) according 
to the manufacturer's protocol. Cells were cultured for 3 
or 6 days in RPMI1640 medium supplemented with 10% 
(v/v) FBS. 

Sequences of specific siRNAs were described in 
Supplementary Table SI. 

Quantitative polymerase chain reaction for Agotaxis 
sRNAs 

Total RNA was prepared from cells with Sepasol-RNA I 
Super G (Nakarai Tesque, Inc., Kyoto, Japan) and 
reverse-transcribed with the miScript II RT Kit (Qiagen, 
Hilden, Germany) to synthesize complementary DNA of 
sRNAs. The sRNAs were polyadenylated by poly (A) 
polymerase, then, bound by oligo-dT primer that have a 
5' universal tag sequence and 3' degenerate anchor and 
reverse-transcribed by miScript Reverse Transcriptase. 
The cDNA with the universal tag was amplified by its 
complementary sequence and the gene-specific forward 
primers. Real-time polymerase chain reaction (PCR) was 
performed using the StepOne software and miScript 
SYBR Green PCR Kit (QIAGEN). Each of 2xSYBR 
Master Mix 5 jil, RNase-free water 2 ul, 5 uM target 
primer 1 jil and 1 (il of 10 x universal primer mixture 
were added to 1 jil of water-soluble sample of cDNA. 
The reaction was run at 95°C for lOmin, followed by 40 
cycles at 95°C for 15 s and 60°C for 1 min. miR-21, miR- 
155, RNU6 and candidate sRNAs were detected by quan- 
titative PCR (qPCR) using the miScript SYBR Green 
PCR Kit (Qiagen). Sequences of specific primers were 
described in Supplementary Table SI. 

Quantitative PCR for genes 

For target gene detection, RT-PCR was performed using 
the High Capacity cDNA Reverse Transcription Kit 
(Applied Biosystems, Inc., CA, USA) and qPCR was 
carried out with the Fast SYBR Green Master mix. All 
real-time qPCR was conducted using the StepOnePlus 
real-time PCR system (Applied Biosystems). Threshold 
cycle (CT) values were calibrated to Glycer aldehyde- 3- 
phosphate dehydrogenase (GAPDH) and analyzed by 
the 2- A ACT method. Statistical analysis was done by 
student t-test. Sequences of specific primers were described 
in Supplementary Table SI. 



Dual luciferase assay 

In each well of a 96-well plate, 293T cells were 
cotransfected with 30 ng of psiCHECK-2s (Promega) 
and 90 ng of siRNA. The siRNAs were purchased from 
Cosmo Bio Co., Ltd. (Tokyo, Japan). Sequences of the 
siRNAs for Dual luciferase assay were described in 
Supplementary Table SI. 

After 48 h of transfection, the relative amounts of 
Renilla and firefly luciferase were determined by dual- 
luciferase assay (Promega). The Renilla/firefly luciferase 
ratio was calculated and normalized against the control. 

Gene expression analysis 

RNA from cells used for microarray analysis was isolated 
using the RNeasy Mini Kit (Qiagen, Hilden, Germany). 
For microarray analysis, splenocytes were cultured for 
72 h with or without lOuM IM. Gene expression micro- 
array analysis was performed using two-colour micro- 
array-based gene-expression analysis (Agilent 
Technologies, Santa Clara, CA, USA) according to the 
manufacturer's instructions. After scanning, expression 
values for the genes were determined using 
GeneSpringGX software. All experiments were performed 
in duplicate. 

Analysis of CLASH data 

Sequencing data generated with the CLASH technique 
and recently published (ref., GEO accession 
GSM 1219490) were analyzed as previously reported (10) 
with the following modifications: 

(1) Custom BLAST database was supplemented with the 
sequences of Agotaxis sRNAs (ASR1: gagaaagctca 
caagaactgct; ASR2: ccccccactgctaaatttgactg; ASR3: 
tcccactgcttcacttgactagc; ASR4: tcccccactgctaaatttgact 
gg; ASR5: aagcagggtcgggcctggt; ASR6: gggaataccggg 
tgctgtaggc). 

(2) We selected all chimeras such that one bit was 
mapped to an ASR and the other bit was uniquely 
assigned to a protein coding gene (multiple tran- 
scripts allowed). Overview diagrams of ASRs- 
mRNA chimera preparation are presented in 
Figure 5b. 



Lytic phase induction assay 

Akata (+) cells were stimulated with rabbit anti-human 
IgG polyclonal antibody (final 20 |ig/ml) (Dako) at 37°C 
in a C0 2 incubator for 15-40h and induced into lytic 
phase. Cells were counted before and after stimulation 
and then the numbers of dead cells were calculated. For 
Agol knocked down experiments, siRNAs were 
electroporated into the cells by Neon® transfection 
system 2 days before anti-human IgG stimulation and 
the cells were cultured for 48 h. After stimulation, cells 
were harvested and total RNAs were prepared with 
Sepasol-RNA I Super G. To quantify sRNA expression, 
RT-PCR was performed using the miScript II RT kit, and 
qPCR using the miScript SYBR Green PCR kit. For 
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target gene detection, RT-PCR was performed using the 
high-capacity RNA-to-cDNA kit and qPCR was carried 
out with the Fast SYBR Green Master mix. All real-time 
qPCR was conducted using the StepOnePlus real-time 
PCR system. 

RESULTS 

Ago-associated sRNA libraries indicate that equivalent 
amounts of human and EBV-encoded miRNAs associated 
with Agol and Ago2 

We analyzed the short RNAs from immunoprecipitated 
Agol and Ago2 proteins isolated from L591 cells derived 
from EBV-positive Hodgkin's lymphoma cells. Raw tags 
from the generated data set were mapped to the hgl9 
human genome assembly using the procedures outlined 
in the 'Materials and Methods' section. 

Notably, 30-40% of the total sRNAs in the latent- 
infected EBV cells were derived from the EBV genome, 
as was reported previously (4,11) (Figure la). 

The majority of tags were mapped to regions annotated 
as both human and EBV miRNAs. Consistent with 
previous observations, the top 50 abundant sequence 
tags derived from the miRNA loci in both humans and 
EBV globally associate with Agol and Ago2 in approxi- 
mately equivalent amounts (Figure lb and c). 
Interestingly, tags that were derived from loci other than 
miRNA (non-miRNA loci) were found more often in the 
Agol than in the Ago2 IP sRNA library (26 versus 6%) 
(Figure Id). 

A new class of sRNAs selectively associated with Agol 

Interestingly, some unique sequence tags from non- 
miRNA loci, derived from tandem loci in the human 
genome, predominantly associate with Agol but not, or 
only rarely, with Ago2. The definition of tandem loci is as 
follows: when one locus has a read count of >10, both 
flanking loci located within 100 bases from the first locus 
also have >10 reads (Figure 2a). The typical loci are 
shown in Figure 2b. More than 2000 counted RNAs are 
listed in Figure 2c. We named the sRNAs as Agotaxis 
small RNAs (ASRs). The representative ASRs were 
further investigated (Figure 2d). The results for several 
of these sRNAs by next-generation sequencing were 
validated by real-time PCR, which showed that the 
sRNA levels were significantly higher in Agol- than 
Ago2-immunoprecipitated samples (Figure 2e). The ex- 
pression of the sRNAs in L591 cells was confirmed by 
the northern blot analysis (12) (Supplementary Figure). 

Downregulation of DROSHA does not lead to the 
decrease of Agotaxis 2, 3 and 4 in L591 cells 

From the registered sequence of ASRs transcripts shown 
in Figure 3 a, we predicted their folding structures by the 
mfold web server (http://mfold.rna.albany.edu/7q 
= mfold/RNA-Folding-Form). Every ASR potentially 
form secondary hairpin loop structures (Figure 3b). We 
performed 5' RACE and found only ASR1 transcript con- 
tained the flanking regions (data not shown). In the 



canonical pathway of miRNAs processing, the flanking 
regions of primary miRNAs are cleaved by DGCR8/ 
DROSHA complex. We evaluated the expression of 
ASRs when DROSHA was knocked down in L591 cells 
by siRNA against DROSHA (siDROSHA). The expres- 
sion of DROSHA was significantly decreased 3 days after 
the transfection (Figure 3c). The amounts of ASRs and 
mature miR-21 were evaluated with real-time PCR 6 days 
after the transfection (Figure 3d). While miR-21 was sig- 
nificantly decreased in siDROSHA-transfected L591 cells, 
the expressions of ASRs except ASR1 did not change 
(Figure 3d). About 30% reduction in ASR1 was con- 
firmed (Figure 3d). The result suggested DROSHA is 
not involved in the processing of ASR2, 3 and 4. 

Characteristic features of the first nucleotide preference 

The 5' ends of miRNAs and piRNAs are mostly uridine, 
which likely binds to Ago/Piwi proteins (13). The 5' ends 
of the top 500 abundantly expressed ASRs were analyzed. 
A large proportion of the 5' ends of the ASRs are C, G or 
A (82.2%), with uridine being a minor component 
(17.8%). Therefore, these binding proteins are differ- 
ent from those associated with canonical miRNAs 
(Figure 4a). The composition of the nucleotides of ASRs 
is slightly biased to C [A (23.7%):G (25.0%):C (26.9%):U 
(24.5%); Figure 4b]. We investigated the position-specific 
motif for the top 500 abundantly expressed ASRs using 
the MEM software (http://meme.sdsc.edu). The top con- 
sensus motif of 37 ASRs from the 500 ASRs was CACU 
at the 5' end +2 and UUGACU at the 3' end (Figure 4c). 
The alignment of 37 ASRs was shown in Supplementary 
Figure SI. We observed substantial sequence variation in 
ASRs. Some of the isoforms of ASRs had comparable 
abundances with major ASRs (Figure 2c). 

Expression profiles of ASRs 

Expression of ASRs in the peripheral blood mononuclear 
cell (PBMC), THP-1, HeLa and L1236 cells, which were 
derived from Hodgkin's lymphoma cells but were not 
EBV infected, were analyzed. All cells and cell lines 
express ASRs. The expression of ASRs is not restricted 
to EBV-infected cell lines, but seemed to be ubiquitous 
(Figure 4d). Moreover, the expression of ASRs in Agol 
IP and Ago2 IP LI 23 6 cells by use of the next-generation 
sequencing demonstrated that ASRs were selectively 
incorporated into Agol, indicating that selective incorp- 
oration of these sRNAs into Agol is not restricted to cells 
infected by EBV (Figure 4e), which was again confirmed 
by the real-time PCRs (Figure4e). 

ASRs in cells are stabilized by Agol proteins 

The expression of ASRs was not altered by RNase treat- 
ment after Agol IP, suggesting that ASRs bind specifically 
to Agol (data not shown). In addition, ASRs are highly 
dependent on Agol. When Agol was knocked down, their 
expression decreased dramatically, while that of Ago2 was 
unchanged (Figure 4f left). In this condition, ASRs expres- 
sion was determined by real-time PCR. In Figure 4f, right, 
the expression of miR-21, which is incorporated into Agol 
and Ago2 equally, was downregulated by half when Agol 
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Figure 1. Equivalent amounts of human and EBV-encoded miRNAs associated with Agol and Ago2. The short RNAs that were co-immunopre- 
cipitated with Agol and Ago2 derived from L591 cells were analyzed, (a) The ratio of sRNAs mapped to the human genome and the EBV genome. 

(b) Equivalent amounts of human miRNAs associated with Agol and Ago2. The top 50 highly associated human miRNAs were plotted. 

(c) Equivalent amounts of EBV-encoded miRNAs associated with Agol and Ago2. The EBV-encoded miRNAs, co-immunoprecipitated with 
Agol and Ago2, with reads of > 10 000 counts (42 miRNAs) were plotted, (d) The ratio of sRNAs mapped to miRNA-registered loci, non- 
miRNA nonannotated loci, snRNA, snoRNA, mt-tRNA and the other noncording sRNA loci. 



was knocked down. Expression of ASR2 and ASR3 in 
Agol -knockdown cells was reduced to one-fifth that of 
the control. Most strikingly, the expression of ASR1 and 
ASR4 in Agol -knockdown cells was reduced dramatically 
to almost below the detection limit. On the other hand, the 
amounts of ASRs did not change in mild Ago2 knocked 
down L591 cells (14) (Supplementary Figure S2). 
Free sRNAs are unstable in the cell and are digested 
rapidly (15). Therefore, ASRs are predominantly 
associated with Agol in a highly specific manner, which 
prevents ASRs incorporation by Ago2, 3 or 4. This was 
confirmed by determining the expression levels of sRNAs 
associated with Ago2 when Agol was knocked down. 
Expression levels of some were increased slightly in the 
Ago2 RISC complex, while others were unchanged 
(Supplementary Table SI). These results indicate that 
the majority of ASRs are incorporated preferentially 
into the Agol-RISC complex. 



ASRs can silence targets, suggesting that ASRs are 
potentially functional 

To determine their regulatory roles, we searched for 
putative targets of four abundantly incorporated 
ASRs and isoforms. In silico prediction of the 
binding sites of the 3' untranslated region (UTR) of 
ASR2 and ASR4 by miRanda v3.3, which are highly 
downregulated in Agol -knockdown L591 cells, are 
listed in Supplementary Figure S4. Among them, 
Family with sequence similarity 22 (FAM22) and 
Myocyte enhancer factor 2B (MEF2B) expression in 
Agol -knockdown L591 cells was upregulated 2- and 
4-fold, respectively (Supplementary Figure S4). ASR2 
and ASR4 target sequences of FAM22 and MEF2B 
were shown in Figure 5a. The results were validated 
by assessment of the capacity to direct functional re- 
pression, using a luciferase assay. 
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Figure 2. The sRNAs selectively associated with Agol are derived from tandem loci, (a) Definition of tandem loci and sRNA counts, which were 
mapped to the loci. When one locus (black arrow) with a read count of >10 reads and both flanking loci (gray arrows) located within 100 bases from 
the original locus also have >10 reads, (b) Representative loci are shown. Some Agol associating sRNAs are aligned to 228 776000-228 783 000 bp 
area of chromosome 1 and 148 679 000-148 686 000 bp area of chromosome 7. The areas contain ASR3, ASR4 and many tandem locus. Black reads 
are aligned to forward strand, and gray reads to reverse strand, (c) Some unique sRNAs are selectively associated with Agol. The counts of sRNAs 
associated with Agol and Ago2 (>2000 counts), derived from tandem loci, are listed, (d) The sequences of representative ASRs are shown, (e) 
Associations of ASR1, 2, 3 and 4 with Agol or Ago2 were determined by real-time PCR to validate the next-generation sequencing data. The data 
were normalized by the amount of RNA. Black bar indicates Agol -associated RNA expression; white bar, Ago2. 



We cloned the FAM22 and MEF2B 3' UTRs into a 
luciferase reporter vector and found that exogenous 
ASR2 and ASR4 repressed reporter activity, while the 
mutants had little or no effect (Figure 5a). This confirmed 
that ASR2 and ASR4 negatively regulate the mRNA ex- 
pression of FAM22 and MEF2B directly through the 3' 
UTR. The results indicate that these ASRs function in 
similar ways to miRNAs. 



Moreover, the binding locus of ASRs in 293T cells was 
comprehensively investigated by use of the data sets pre- 
viously reported (10). New technique for ligation and 
sequencing of miRNA-target RNA duplexes associated 
with human Agol, known as 'CLASH', revealed that 
the majority of ASRs bind not only to mRNA 3' UTRs 
but also to coding regions (CDS) and 5' UTR (10). 
The library (-18 000 high-confidence miRNA-mRNA 
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ASRs Sequence (5' > 3') 



ASR1 GflGAAAGCX^ 

ASR2 GGOGGTOQGAT^ 

ASR3 ffQjGTOg/O^ I fD I IGI ItJCTOX OJGG^^ 

ASR4 G^QJJGTOCTA^ 




Figure 3. Processing of ASRs is independent of DROSHA. (a) Sequences of precursor of ASR transcripts are shown. That of ASR1 is 
ENST00000387449; ASR2, NR_004393.1; ASR2, NR_004392.1; ASR4, NG_032096.1. Underlined sequences indicate mature ASRs. (b) Structures 
of ASR1, 2, 3 and 4 were predicted with sequences indicated in Figure 3a by the mfold web server. Highlighted regions indicate mature ASRs 
sequences, (c and d) Control siRNA (Ctrl.) or siDROSHA were transfected into L591 cells by Neon™ transfection system, (c) Expressions of 
DROSHA was evaluated by real-time PCR 3 days after the transfection. Expression levels relative to Ctrl, were normalized with GAPDH and 
shown, (d) Cells were cultured for 6 days, and then total RNAs were extracted. Amounts of ASRs and mature miR-21 were determined by real-time 
PCR. Data were shown as relative expression to the Ctrl, normalized with RNU6. Error bars indicate SD. * J P<0.05. 



interactions) contained the binding locus of top 20 abun- 
dantly expressed ASRs, which revealed 90 loci. Among 
them, centromere protein B (CENPB), leptin receptor 
(LEPR), Muscleblind-like splicing regulator 1 (MBNL1), 
pygopus family PHD finger 2 (PYG02)and Testis-specific 
kinase 2 (TESK2), which have the putative target binding 
sites of ASRs, were upregulated in Agol knockdown L591 
cells (Figure 5b). These results suggest that it is highly 
possible that these five genes are regulated by ASRs in 
L591 cells as well. 

Expression of ASRs in the EBV life cycle 

We showed that many EBV-encoded miRNAs were ex- 
pressed in EBV-latent-infected cells (Figure la). During 
acute EBV infection, the virus sequentially expresses its 
entire repertoire of genes, producing a lytic infection, 
while few genes are expressed during latent infection. 



Recently, it was reported that murine gammaherpesvirus 
68 (MHV68) infection could induce and upregulate endo- 
siRNAs from short interspersed nuclear elements in 
murine cells during lytic infection (16). Therefore, 
linkage of the EBV life cycle with ASRs expression was 
investigated. 

Akata is a type 3 latent-infected cell line derived from 
an EBV-positive Burkitt's lymphoma patient. The lytic 
phase was induced by Akata membrane IgG and anti- 
IgG cross-linking. When the lytic phase was induced, the 
cells underwent apoptosis. According to our results, ex- 
pression level of ASR2, 3, 4 and 5 was linked with apop- 
totic cell numbers (Figure 6a). When the >60% Akata 
cells were dead, ASRs were increased > 30-fold 
compared with latent-phase Akata cells. Moreover, 
when the lytic phase was induced with AGOl knocked 
down cells, the expression of ASRs was decreased 
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Figure 4. Characterization of ASRs. (a) The 5' base of the top 500 abundantly expressed ASRs. (b) The composition of the top 500 abundantly 
expressed ASRs. (c) The consensus motif of top 500 abundantly expressed ASRs. (d) Expression of representative ASRs in HeLa, THP1, L591, 
L1236 and PBMCs were analyzed by real-time PCR, normalized by GAPDH (n = 3). (e) Associations of ASR1, 2, 3 and 4 with Agol or Ago2 in 
LI 236 cells were determined by next-generation sequencing and real-time PCR normalized by the amount of RNAs. The read counts of the ASRs 
listed in Figure 2c in Agol-IP and Ago2-IP L1236 cells are summarized. Error bars indicate SD. *P < 0.05. (f) mRNA expression of Agol and Ago2 
in Agol -knockdown (black bar) and control (white bar) L591 cells (left). Expression of representative ASRs under each condition. Both data were 
normalized by GAPDH. 
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Figure 5. ASRs can bind and silence targets, (a) MEF2B and FAM 22 were regulated by ASR2 and ASR4 though their 3' UTR. The putative target 
sequence of ASR2 and 4 in MEF2B and FAM22-3' UTRs (left). Reporter vectors were constructed by inserting MEF2B and FAM22-3' UTRs into 
Renilla luciferase in a psiCHECK2 vector. Renilla to firefly luciferase ratios are shown (right). All binding sites caused ASR2- and 4-dependent 
downregulation but not control or mutant (agggggacuaaauuugacgg) (*P < 0.05). (b) The sequence-specific binding of ASRs at the CDS, 3' UTR and 
5' UTR of the target mRNAs in 293T cells, which were revealed by CLASH and downregulated in Agol knockdown L591 cells. The five genes, 
which have putative target sequences of ASRs in the mRNAs, were bound by ASRs in 293T cells, and upregulated by Agol knockdown in L591 cells 
are listed (upper). The putative target sequences of ASRs in the target mRNAs are shown (lower left). Overview diagrams of ASRs-mRNA chimera 
preparation (lower right). AGO 1 -associated sRNAs including ASRs and target mRNAs were cross-linked to PTH-AGOl by UV exposure. The 
protein-RNA complexes were purified. mRNA 5' ends were phosphorylated with PNK treatment and ligated to associated ASRs. Finally, 5' and 3' 
adapters were ligated to the chimeric RNAs for next-generation sequencing. 
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Figure 6. Expression of ASRs is correlated with the EBV life cycle. Akata cells were stimulated with anti-human IgG antibody and induced into the 
lytic phase. The cells were harvested at each point and total RNAs were purified, (a and b) The expression levels of total ASRs were measured by 
qPCR and calibrated with GAPDH. (a) ASRs relative expression levels are shown as fold-expression changes compared with that of the latent phase, 
(b) Akata cells were transfected with/without siRNA against AGOl by Neon system (siAGOl and Ctrl, respectively). After 2 days, cells were 
stimulated with/without anti-human IgG antibody to induce EBV lytic phase. After 2 more days in culture, cells were harvested and relative ASRs 
expression to Ctrl was measured by qPCR. Ctrl and siAgol indicate latent phase; Ctrl+IgG and siAgol+IgG, lytic phase. *P<0.05. (c) The cell 
numbers at harvested points were counted. The dead cell ratios were calculated by 1 — (IgG stimulated cell numbers/nonstimulated cell numbers), 
(d) The mRNA expression levels of ASRs' target genes were calculated by qPCR and calibrated with GAPDH. Relative expression levels are shown 
as fold-expression changes compared with that of the latent phase. Ctrl indicates latent phase; low, <40%; middle, 40-60%; high, >60% dead cell 
ratio. The data are shown as means and SD. 



(Figure 6b). Under the condition, the apoptotic cell 
numbers tended to be upregulated (Figure 6c). These 
results indicated that in the lytic phase, the ASRs are sig- 
nificantly upregulated depending on AGOl. 

The miRANDA predicted target genes of these sRNAs 
in L591 cells were listed (Supplementary Figure S4). 
Among them, we focused on INTS5, MEF2B and 
EIF2AK3, which are involved in viral reactivation, and 
analyzed their correlation with the EBV life cycle in Akata 
cells by qPCR (Figure 6d). The lytic phase was categorized 
by dead cell ratio into: low, <40%; middle, 40-60%; high, 
>60%. Expression of INTS5 and MEF2B showed a 
tendency to be decreased especially in middle level of the 
cell apoptosis. Interestingly, the expression of all these 
three genes tended to increase in high dead cell ratio 
(Figure 6d). 



DISCUSSION 

In Drosophila, siRNAs and miRNAs are actively sorted 
into functionally distinct Ago-RISCs based on differences 
in structure (17,18). Perfectly matched duplexes are pref- 
erentially incorporated into Ago2, whereas duplexes with 
central mismatched bulges (miRNA-like) are sorted to 
Agol (17,18). A similar sorting mechanism exists in 



Caenorhabditis elegans, whereby sRNA duplexes with per- 
fectly matched or bulged stems are channelled into RNAi- 
defective 1 (RDE-1) or Argonaute (plant)-like gene 1 
(ALG-1), respectively (19). 

In mammals, the regularity of miRNA sorting onto 
distinct Ago proteins is poorly understood. Mammals 
have four Agos (Ago 1-4) that are involved in the 
miRNA pathway. Among them, Ago2 is unique and 
possesses the sheer activity that mediates the cleavage of 
perfectly matched targets for miRNAs and siRNAs (20). 
When individual Agos are constitutively ablated in mice, 
only the loss of Ago2 causes embryonic lethality, whereas 
single loss of Agol, Ago3 or Ago4 is dispensable for 
animal development. However, RNA-sequencing of 
Agol-, Ago2- and Ago3-associated miRNAs revealed 
that some have a bias toward particular Ago proteins (21). 

We investigated the effect of viral infection on the 
sorting of sRNAs onto Ago proteins and identified 
sRNAs derived from the human genome and incorporated 
exclusively into Agol (ASRs). 

The 5' ends of the ASRs in are mostly C, G or A; 
uridine represents only a minor portion, implying that 
their binding proteins differ from those associated with 
canonical miRNAs (16). 

In Drosophila, several studies have demonstrated that 
miRNA sorting onto Ago proteins is coupled to strand 
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selection and depends on specific structural and sequence 
criteria, in which the sRNAs in the Agol protein complex 
with 5' cytosine are more stable than those with 5' uridine 
(13).The fate of a miRNA/miRNA* duplex, therefore, 
depends on multiple factors, including the structure of 
the duplex and the thermodynamic stability of the ends 
of the duplex (22). The importance of each factor in the 
sorting decision in mammals is as yet unknown. 

We found that several ASRs harbor motifs that are 
preferred at locations on both their 5' and 3 r ends. 

The motifs on the 5' ends of ASRs predict the target 
sequences that are commonly bound by ASRs. The 
common target sequence located in 3' UTR of tumor sup- 
pressor genes has been recently reported, which is puta- 
tively bound by oncogenic miRNAs (23). The putative 
target sequence complementary to the motif on the 5' 
ends of ASRs is completely different from the common 
target sequence bound by oncogenic miRNAs. It seems 
likely that the motif on the 5' end of ASRs has some 
other regulatory functions. 

The biogenesis of ASRs is independent from Drosha. 
They are incorporated in Agol -RISC, bind mRNAs and 
work in similar way to miRNAs. Mirtrons also works as 
miRNA, which regulate RNA via a multistep process 
involving intron splicing and debranching, exosome- 
mediated trimming of the 3 -tail and dicing. Top 20 
highly expressed ASRs are coded within intergenic or 
transfer RNAs but not intron (24). 

Accordingly, ASRs and Mirtron should be separately 
categorized. Still their biogenesis and functional mechan- 
ism seem to be overlapped. Further studies about the com- 
parison of two kinds of sRNAs are needed. 

Recently, 3' motifs of miR-29 have been reported as 
nuclear transport signal (25). The 3 f common sequence 
of ASRs, which is not similar to the 3' motif of miR-29, 
might be involved some other function such as transport 
and stability. Most importantly, whether both of the 
motifs are involved in the Agol/Ago2 sorting decision 
needs further investigation. 

ASRs regulate expression of their targets such as 
FAM22 and MEF2B by binding to the 3 f UTR in L591 
cells. Therefore, ASRs have at least some functional simi- 
larity to miRNAs. Recent comprehensive analysis of 
sRNAs and their binding target RNAs, known 
'CLASH', revealed that ASRs were incorporated into 
Agol -RISC and bound to several mRNAs in 293T cells 
(10). Intriguingly, the data revealed that ASRs were also 
shown to bind mostly to CDS. Several genes, including 
CENPB, LEPR and PYG02, bound by ASRs in Agol- 
RISC in 293T cells had putative binding sites of ASRs and 
were upregulated by Agol knockdown in L591 cells, 
which suggest that these genes are highly possibly 
regulated by ASRs in L591 cells. CENPB, centromere 
protein B, is a DNA binding protein that recognizes 
CENP-B box in the centromeric alpha satellite DNA 
(26). The protein has an important role in assembly of 
kinetochore structure. In apoptotic cells, the protein is 
characterized as autoantigen that is cleaved to 60-kDa 
fragment and observed in blebs (27). It implies that 
ASRs may reduce autoimmune response in apoptotic 
phase. 



By contrast, LEPR and PYG02 are reported as anti- 
apoptotic elements. LEPR is known as the regulation of 
fat metabolism and hematopoiesis. Leptin activate Janus 
kinase/Signal transducer and activator of transcription 
(JAK/STAT) pathways via LEPR, which result in 
enhanced expression of anti-apoptotic factor, B-cell CLL/ 
lymphoma 2 (BCL-2) (28). PYG02 is a component of Wnt/ 
beta-catenin transcriptional complex. Overexpression of 
PYG02 reduces apoptotic activity in human carcinoma 
cell lines (29) 

Together with our results in EBV lytic infection, ASRs 
might be involved in apoptosis. The significance and 
function of ASRs, including their binding to CDS 
warrants further investigation. 

Finally, we analyzed the linkage of expression of ASRs 
with EBV infection status. Interestingly, the expression of 
ASRs was dramatically upregulated during lytic EBV in- 
fection. Small RNAs upregulations during lytic phase 
were consistent with the case of MHV68 infection (16). 
The gene expression pattern is dramatically changed 
between virus infectious latent and lytic phase. ASRs 
might be critical for this phase conversion. When Agol 
was knocked down, ASRs were downregulated and much 
more cells tended to be induced into apoptosis. ASRs de- 
pending on Agol might suppress apoptosis, terminal 
phase of lytic infection. The hypothesis was further 
evaluated by investigation of the target genes of ASRs 
involved in EBV reactivation. It is presumed that as 
more ASRs were expressed, there would be a correspond- 
ing decrease in the expression of the target genes. To the 
contrary, the general trend of the three genes supposed to 
be involved in EBV reactivation was upregulation correla- 
tive^ to apoptosis levels (Figure 6d). 

INTS5, integrator subunit 5, is a member of a 
multiprotein complex associated with the C-terminal 
repeats of RNA polymerase II, and mediates small 
nuclear RNA 3'-end processing (30). ASRs may reduce 
INTS5 function and suppress EBV-encoded miRNA pro- 
cessing. EIF2AK3, eukaryotic initiation factor 2-alpha 
kinase 3, is a type I endoplasmic reticulum (ER) trans- 
membrane protein (31). It contains a stress-sensing 
domain. When unfolded proteins are sensed in the ER, 
protein synthesis is decreased, resulting in apoptosis 
(32). However, EIF2AK3 also has reported pro-tumori- 
genic properties (33). ASRs may function as a balancer of 
EIF2AK3 function, but further investigation is needed. 
MEF2B, myocyte enhancer factor 2B, is a transcriptional 
activator. A recent report showed that MEF2B (34) con- 
tributes to EBV-infected cell reactivation from the latent 
stage. Together with the upregulation of ASRs in the lytic 
infection, the result suggests that ASRs might have some 
functions through the posttranscriptional regulation of 
these target genes to prevent the lytic infection from 
'overactivation'. 

In mammals, the Ago 1-4 family sorting mechan- 
ism is unknown. Still, the class of sRNAs select- 
ively incorporated onto Agol, 'ASRs', appear to play 
vital roles in some aspects of mammalian biology. A 
mechanistic analysis of their processing and stability, 
and an extended investigation of the function of the 
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ASRs in EBV-infected cells, needs thus further 
investigation. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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